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Abstract: Both a- and (3-thalassaemia syndromes are public health problems in the 
multi-ethnic population of Malaysia. To molecularly characterise the a- and (3-thalassaemia 
deletions and mutations among Malays from Penang, Gap-PCR and multiplexed 
amplification refractory mutation systems were used to study 13 a-thalassaemia determinants 
and 20 (3-thalassaemia mutations in 28 and 40 unrelated Malays, respectively. Four 
a-thalassaemia deletions and mutations were demonstrated. — SEA deletion and a cs a 
accounted for more than 70% of the a-thalassaemia alleles. Out of the 20 (3-thalassaemia 
alleles studied, nine different (3-thalassaemia mutations were identified of which (3 E accounted 
for more than 40%. We concluded that the highest prevalence of (a- and (3-thalassaemia 
alleles in the Malays from Penang are — SEA deletion and (3 E mutation, respectively. 
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1. Introduction 

Thalassaemias are autosomal recessive disorders characterised by quantitative defects in globin 
chain synthesis. These biosynthetic defects can be classified according to the globin chain or chains 
involved in deficient synthesis. Best characterised are a- and (3-thalassaemia [1], which result from 
defective synthesis of alpha and beta chains, respectively [2]. 

Thalassaemia is one of the most common inherited disorders of haemoglobin [3] and is currently a 
public health problem affecting Malaysia's multi-ethnic population [4]. Studies have shown that 
3%-5% of the 28.3 million people in Malaysia are carriers of the thalassaemia gene [5,6]. The major 
ethnic groups are Bumiputras who make up the majority, followed by the Chinese and Indians [7]. 
In 2010, there were 4768 registered thalassaemia patients in Malaysia and between 600,000 and 
1,000,000 carriers of the thalassaemia trait [4]. Current statistics show that 1 in 20 Malaysians are 
carriers of the (3-thalassaemia trait [8]. It was further estimated that 120-350 infants are born each year 
with transfusion-dependent thalassaemia [8]. 

In most of the existing clinical set ups in Penang, the diagnosis of thalassaemia is based on data obtained 
from clinical findings, blood picture and haemoglobin analysis rather than molecular characterisation. 
It would be prudent to perform molecular characterisation for the genotype before clinical management 
of thalassaemia is begun. Characterising the most common a- and (3-thalassaemia mutations in the 
Malay population should make the subsequent diagnostic approach simpler, more cost effective and 
quicker by focusing on the small number of ethnically predominant alleles instead of a wide range of 
rare alleles. Since the frequencies of a- and (3-thalassaemia alleles vary considerably with geographic 
location and ethnic group, this study was performed to characterise a- and (3-thalassaemia mutations at 
the molecular level among Malay patients from Penang, a northern state of Peninsula Malaysia. 

2. Results 

A total of 56 a-globin gene haplotypes from Malay patients from Penang were characterised. As 
summarized in Table 1, two forms of deletions and two forms of mutations were detected out of the 
13 different a-thalassaemia determinants tested. The most frequent amongst them was — SEA , 
demonstrated on 20 a-globin haplotypes with a prevalence of 35.7%. This was followed by the chain 
termination mutant (a cs a) that was found on 30.3% (17/56) of a-globin gene haplotypes studied. The 
remaining two alleles, -a and Cd 59, were less frequently observed. These four determinants, along 
with the normal haplotype (aa), were observed to interact in six different genotypes as depicted in 
Table 2. A posteriori high frequency of — la a made up 46% of the genotypes. About 21% (6/28) 
were compound heterozygous for the — SEA /-a 3 7 a. Surprisingly, 4/28 (14.3%) of patients were 
compound heterozygous for the a a/ a a genotype; and 2/28 (7.1%) of patients were carriers of the 
a a/aa genotype. It was observed that 2/28 (7.1%) of patients were homozygous for the -a /-a 
genotype, while 1/28 (3.6%) of patients were heterozygous for the — SEA /aa genotype. The other 
deletion and non-deletion a-thalassaemia gene defects screened were not detected using the above test 
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methods. Some of the representative agarose gel electrophoresis are shown in Figures 1 and 2 for 
multiplexed ARMS-PCR and Gap-PCR, respectively. 

Table 1. Spectrum of a-thalassaemia gene defects identified in Malay patients by multiplex 
Gap- and ARMS-PCR. 



Alleles 


No. of Mutations 


Prevalence (%) 


SEA 


20 


35.7 


^Constant Springy 


17 


30.3 


-a 3 ' 7 


10 


17.9 


Cd 59 (GGC > GAC) 


6 


10.7 


aa haplotype 


3 


5.4 


Total 


56 


100 


Table 2. Prevalence of a-thalassaemia genotypes in Malay patients. 


Genotypes 


No. of Patients n = 28 


Prevalence (%) 


— SEA /a cs a 


13 


46.4 


-a 3 ' 7 /— SEA 


6 


21.4 


a Cd59 a/a cs a 


4 


14.3 


-a 3 ' 7 /-a" 


2 


7.1 


a cd 59 a/aa 


2 


7.1 


- SEA /aa 


1 


3.6 


Total 


28 


100 



Figure 1. Multiplex-Gap PCR genotype analysis of the a-globin gene cluster on agarose 
gel electrophoresis. Lanes 1 and 12 (left to right) show DNA marker ladders; 2350 bp band 
on lane 2 depicts the internal control and 1800 bp band indicates the presence of normal a2 

3 7 4 2 

globin gene. Lanes 3-5 are positive controls for heterozygous -a deletion, -a deletion, 
and — SEA deletion; Lanes 6-10 are patients where lane 6 shows compound heterozygosity 
for -a 3 7 and — SEA deletions; Lanes 7 and 8 "none of the deletion tested present"; 
Lane 9 -a 3 7 deletion; Lane 10 — SEA deletion; and Lane 1 1 is non- template control. 
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Figure 2. Multiplex ARMS PCR genotype analysis of a globin gene cluster on agarose gel 
electrophoresis. Lane 1 ladder; Lanes 2-4 are positive controls for Cd 59 (G > A), 
Cd 125 (T > C), and term Cd TAA > CAA (Hb CS) with their respective bands; 
Lane 5 non-template control; Lanes 6 and 7 Hb CS; and Lane 8 Cd 59 (G > A); 
930 bp bands on Lanes 2-4 and 6-8 are internal control bands amplifying a segment of 
3' UTR of LIS1 gene. 




Table 3. Frequency of P-thalassaemia mutations identified in 


Malay patients using 


Multiplex- ARMS. 






Mutations 


Percentage % (n) 


Total 


Heterozygous mutation 






p/pCd26 (G-A) 


55.6 (5) 




P^IVS 1-1 (G-T) 


22.2 (2) 




p/pCd 41/42 (-TTCT) 


22.2 (2) 


9 


Compound heterozygous 






pCd 26 (G-A) IVS 1-5 (G-C) 


32.1 (9) 




pCd 26 (G-A) ypIVS 1-1 (G-T) 


17.8 (5) 




pCd 26 (G-A) ypCd 41/42 (-TTCT) 


14.3 (4) 




pCd 26 (G-A) ypCd 17 (A-T) 


7.1(2) 




pCd 26 (G-A) ypCd 15 (G-A) 


7.1(2) 




pCd 26 (G-A) ypCd 8/9 (+G) 


3.6(1) 




pCd 41/42 (-TTCT) ypCd 19 (A-G) 


3.6(1) 




pCd 41/42 (-TTCT) y^-28 (A-G) 


3.6(1) 




pCd 41/42 (-TTCT)ypIVS 1-1 (G-T) 


3.6(1) 




pIVS 1-5 (G-C)ypIVS 1-1 (G-T) 


3.6(1) 




pIVS 1-5 (G-C) ypCd 19 (A-G) 


3.6(1) 


28 


Uncharacterised 




3 


Total 




40 



A total of 40 Malay patients with P-thalassaemia syndromes were enrolled in this study. From the 
80 P globin genes studied, nine different mutations were detected on 65 chromosomes. As depicted in 
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Table 3, nine patients were heterozygous for (3-thalassaemia, 28 patients were compound heterozygous 
for (3-thalassaemia and three patients remained uncharacterised by this procedure using the 20 different 
mutations described above. As summarised in Table 4, Cd 26 (G > A) and IVS 1-5 (G > C) were the 
two most common mutations observed and they accounted for a little over 60% of the patients. 
IVS 1-1 (G > T) and Cd 41/42 (-TTCT) had equal prevalence cumulatively accounting for a quarter of 
all the mutations observed. The remaining alleles, Cd 15 (G > A), Cd 17 (A > T), Cd 19 (A > G), 
Cd 8/9 (+G) and -28 (A > G) were rare alleles and were observed either on two chromosomes or 
on singletons. Capillary gel electrophoresis for the common (3 globin gene alleles is shown in Figure 3. 



Table 4. Distribution of P-thalassaemia mutations identified in Malay patients. 



Type of Mutations 


Number («) 


Prevalence (%) 


Cd 26 (G-A) 


28 


43.1 


IVS 1-5 (G-C) 


11 


17.0 


IVS 1-1 (G-T) 


9 


13.8 


Cd 41/42 (-TTCT) 


9 


13.8 


Cd 15 (G-A) 


2 


3.1 


Cd 17 (A-T) 


2 


3.1 


Cd 19(A-G) 


2 


3.1 


Cd 8/9 (+G) 


1 


1.5 


-28 (A-G) 


1 


1.5 


Uncharacterised 


6 




Wild type 


9 




Total 


80 


100 



Figure 3. Capillary electrophoresis (CE) electropherogram for multiplex ARMS-A of 
P globin gene cluster. Lanes 2-5 are the positive controls for 41/41 (-TTCT), 
IVS 1-5 (G > C), Cd 26 (G > A), and Cd 17 (A > T) respectively. Lanes 6, 9, 11, and 12 
show Cd 26 (G > A); Lane 10 shows "none of the mutations tested present"; and Lanes 7 
and 8 show the presence of IVS 1-5 (G > C). 



1 2 3 4 5 6 7 8 9 10 11 12 
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3. Discussion 

The reported frequencies of a- and P-thalassaemia mutations and deletions vary considerably 
between ethnic groups and with geographic locations [7,8]. Consequently, this study was performed to 
characterise the a- and P-thalassaemia determinants in the mainland of Penang where the majority of 
the population is Malay. Samples were taken from Malay patients who were previously diagnosed with 
a- or P- thalassaemia by the HPLC method at the Hospital Seberang Jaya (HSJ), Penang. All the 
68 patients had a mean corpuscular volume below 80 fL and mean corpuscular haemoglobin less than 
27 pg. 

The present study showed — SEA to be the most prevalent a-thalassaemia determinant in Penang, 
with a prevalence of 35%, followed by a cs a (30%) and - a 3 7 (18%). The inflated prevalence observed 
here clearly contradicts the nationwide prevalence reported for a-thalassaemia amongst Malays at 
large [9-11]. Several studies have shown plausible evidence that — SEA cis gene deletion as the most 
common amongst Malaysian Chinese, while this deletion is not as common as the - a allele in 
Malays [3,9-11]. On average, throughout the population, the — SEA deletion is characteristically four 
times more common in Malaysian Chinese than in Malays [3,10,11]. Moreover, Hb CS which was 
observed at a prevalence of 30% in this study, by contrast, has a nationwide frequency ranging 
between 1.1% and 3.6% (95%CI) in the ethnic Malays [11]. The most likely explanation for this 
remarkable overrepresentation of the a-thalassaemia genes in this study is the non-random nature of 
samples, representing symptomatic patients attending for regular follow-ups at Hospital Seberang Jaya. 
As is confirmed here, nearly 82% of these patients had HbH disease — out of which over 46% of 
patients had non-deletional HbH disease ( — SEA /a cs a); and 21% of patients had deletional HbH disease 
(-a / — ). Another 14% of subjects had genotype a a/a a presenting phenotypically as 
thalassaemia intermedia or HbH disease, presumably because haplotype <x Cd59 a has a 0 phenotype [2]. 
Therefore, the contact clinical sampling method adopted and the small sample size used for 
characterising the a-thalassaemia gene are likely to be the two main reasons for the discordance in 
allelic prevalence reported here. Furthermore, since no clinical disturbances are generally associated 
with either a + -thalassaemia homozygotes (-aha) or a 0 heterozygotes ( — /aa) these benign 
a-thalassaemia minor individuals are not well represented and neither are the alleles carried by them. 
The aforementioned genotypes are incidentally diagnosed during routine haematological examinations 
or as a result of family screening to diagnose a symptomatic thalassaemia in a relative. It is therefore 
necessary to obtain a simple random sample that is representative of the Malay subpopulation in the 
Penang district to consolidate these observations. The results were similar to a study from the 
Guangdong Province in Southern China. It was noted that the — SEA deletion was the most common 
mutation detected (48.54%) followed by the -a 3 7 deletions [12]. 

A diverse spectrum of nine P-thalassaemia alleles (Table 4) were demonstrated. 
Cd 26 (CAG > AAG), IVS 1-5 (G > C), IVS 1-1 (G > T) and Cd 41/42 (-TTCT) accounted for 86% of 
the mutations characterised. The highly skewed five rare alleles accounted for the remaining 14%. The 
pE(26Giu^Ly S ) mutat i on j s consistently the most common mutation among Malays, while Cd 41/42 
(-TTCT) is more specific and widespread in the Malaysian Chinese [9,13]. Invariably, Hb E 
(p 26Glu ^ Lys ) observed in this analysis at a prevalence of 35% is the most common P globin gene defect 
among Malays from Penang. This high prevalence is consistent throughout the Malay sub-population, 
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though considerable variation exists from one geographic location to another [13-16]. Characterisation 
of P-thalassaemia alleles on a contemporary multiethnic sample by Hassan et al. and George et al. 
revealed that HbE among the Malay ethnic group were 23% and 28.8%, respectively [9,13]. A similar 
study conducted a decade earlier by Tan et al. demonstrated the frequency of HbE in ethnic Malay was 
19% [6]. 

It is observed here that the prevalence of IVS 1-5 (G > C) among Malays from Penang is three-fold 
lower than the most common HbE allele. By contrast, Hassan et al. reported an equal distribution of 
both alleles among Malays throughout Malaysia [13], while Tan et al. reported a posteriori elevation 
of the former by an approximate ratio of 2:1 [6]. This empirical increase in the prevalence of HbE 
among Malays from Penang could be attributed to its clinical severity as the mutation is known to be 
associated with the (3°-thalassaemia phenotype, especially when the gene is in compound 
heterozygosity with a second P-thalassaemia allele ((3 B /p° or p E /p + ). In fact, all the 23 compound 
heterozygous patients with p E (Table 4) characterised here had a history of red cell transfusions at 
variable frequencies. This exemplified the clinical severity and disease heterogeneity of the disease 
when the Hb E allele is in transposition with another P mutation. Pathologically, this point mutation, 
commonly known as haemoglobin variant, simultaneously produces a structural and a quantitative 
defect in globin chain synthesis; hence, it is better known as thalassaemia haemoglobinopathy [17]. 

The third most common P-thalassaemia mutation characterised here is IVS 1-1 (G > T). As with the 
other observations, this mutation is prominently only found in Malays, among whom it has reached a 
high heterozygous carrier frequency and has given rise to a P° phenotype [12,14-16]. 

A relatively high prevalence of Cd 41/42 (-TTCT) is observed in Penang. This allele is prominent 
in the Malaysian-Chinese and less frequent amongst ethnic Malays in the general population [13,14]. 
This remarkable incongruity could be due to miscegenation between Malays and Chinese in Penang 
undergoing a demographic process that changes the allelic boundaries. This phenomenon could be 
acting on the distribution of many other thalassaemia alleles in this multiethnic subpopulation as 
Malaysians live side-by-side, breaking the norms of endogamy. Even a low level of gene flow over 
generations can lead to substantial changes, producing a heterogeneous pattern of allele frequencies. 

Interestingly, Cd 15 (G-A) and Cd 8/9, which are usually the rare types of P-thalassaemia mutations 
among Malays, are also observed in this study. In addition, Cd 15 (G-A) was found in the Thai 
population, at a prevalence of 0.3% [18]. In Malaysia, Cd 8/9 has been found in the Kedayan 
population, while Cd 15 (G-A) was found in Malaysian Indian patients but at a very low frequency [13]. 

4. Experimental Section 

4.1. Sample Collections 

A total of 68 Malay patients attending Hospital Seberang Jaya (HSJ), a part of the northern state of 
Peninsular Malaysia, were enrolled. A subset of 28 patients positive for HbH on high performance 
liquid chromatography (HPLC) or positive for HbH inclusion bodies on 1% Brilliant Cresyl blue 
(BCB) were recruited for a-thalassaemia gene characterisation. A second subset of 40 patients with 
hypochromic microcytic anaemia and elevated HbA 2 above 3.5% were enrolled for P-thalassaemia 
mutation screening. Prior ethical approval was obtained from the Human Research Ethics Committee 
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(JEPeM), Universiti Sains Malaysia and Research and Ethics Committee (MREC, Kuala Lumpur, 
Malaysia), Ministry of Health. Approximately 2 mL of whole blood was collected into EDTA 
vacutainer (Bio Lab, Shah Alam, Malaysia) from each patient after obtaining informed consent. 

4.2. DNA Extraction 

Genomic DNA was extracted from 200 |uL whole blood using a commercially available DNA 
extraction kit, QIAamp DNA Blood Mini Kit (Qiagen GmbH, Hilden, Germany). The concentration 
of DNA was determined by NanoDrop Spectrophotometer ND-1000 (NanoDrop Technologies, 
Wilmington, DE, USA). The extraction was carried out according to the manufacturer's instructions. 
To perform PCR in this study, a concentration of DNA around 50 ng/|iL was required. 

4.3. Molecular Analysis for a- and fi-Thalassaeniia Alleles 

a-thalassaemia genotypes were tested using 13 different determinants commonly found in Malaysia 
and those reported from the Southeast Asia region. The panel included four double a globin gene 
deletions ( — SEA , — MED , — FIL , and — THAI ), three single gene deletions (-a 3 7 rightward deletion, -a 4 2 
leftward deletion, and -a 20 ' 5 ) and six non-deletion a2 globin gene mutations namely initiation codon 
(ATG > A-G), codon 30 (AGAG), codon 35 (TCC > CCC), codon 59 (GGC > GAC), codon 125/Hb 
Quang Zhe (CTG > CCG) and termination codon/Hb Constant Spring (TAA > CAA). 
In-house optimised multiplexed Gap-PCR and multiplexed amplification refractory mutation system 
(M-ARMS) methods were used in parallel for gene deletion and mutation testing, respectively [1 1,19,20]. 

For P-thalassaemia genotyping, 20 different mutations were tested: 19 by M-ARMS and one by 
simple ARMS technique. Genotyping of the 19 P-thalassaemia determinants were carried out in five 
separate multiplexed PCR reactions based on the shared thermo-cycling conditions of respective 
allele-specific primers as described by Hassan et al. (2013) with slight modification [13]. In the first 
M-ARMS-A reaction, allele specific primers for four mutations IVS 1-5 (G > C), Cd 41/42 (-TTCT), 
Cd 17 (A > T) and Cd 26 (G > A) were multiplexed, while IVS 1-1 (G > T), Cd 8/9 (+G), -28 (A > G) 
and Cd 71/72 (+A) mutations were amplified in M-ARMS-B reaction. In the third M-ARMS-C 
reaction, alleles IVS 1-1 (G > A), Cd 43 (G > T), Cd 16 (-C), and Poly A (A > G) were multiplexed, 
while M-ARMS-D had allele-specific primers targeted at -88 (C > T), initiation codon (ATG > AGG), 
Cd 15 (G > A) and -29 (A > G) mutations. Finally, allele specific primers used in M-ARMS-E 
reaction were to screen mutations -86 (C > G), Cd 19 (A > G) and Cap + 1 (A > C) [13]. IVS 2-654 
(C > T) was tested in a separate ARMS reaction. 

4.4. Gel Electrophoresis 

The PCR products for a-thalassaemia were analysed by using agarose gel electrophoresis. 
PCR products of multiplex gap PCR were run on a 1.5% agarose gel electrophoresis using 
lx Tris-Borate-EDTA (TBE) buffer. Whereas, PCR products for a-thalassaemia from multiplex 
ARMS PCR were run on a 1 .2% agarose gel. 

The PCR products from the (3 globin gene, except for reaction in MARMS-A, MARMS-B and 
MARMS-F, were separated using 2% agarose Dl, Low EEO Pronadisa (Laboratorios CONDA, 
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Madrid, Spain) in IX TBE buffer (Biobasic, Markham, ON, Canada). DNA band was visualized under 
transilluminator (Vilber Lourmat, Sud Marne-la-Vallee, France). QIAxel Advanced System (QIAGEN 
GmBH, Hilden, Germany) was used to analyse MARMS-A, MARMS-B, and MARMS-F which used 
automated capillary electrophoresis to separate the products based on their sizes and visualised on the 
interfaced computer system using its dedicated QIAxcel ScreenGel Software (QIAGEN GmBH, 
Hilden, Germany). 

5. Conclusions 

In this study, we have demonstrated the a- and (3-thalassaemia gene frequency in the Penang 
population and further described its allelic distributions in the Malay populations. The heterogeneity of 
the a globin defects in the present study population is different from the other states in Malaysia as 
well as the presence of rare types of P-thalassaemia mutations. This may be attributed to a unique 
situation where there is a higher proportion of miscegenation among Malay with other ethnicities. 

The results from this study will serve as a baseline for further investigations into the genetic defects. 
This information would provide health care professionals better awareness of the possible clinical 
spectrum of a- and (3-thalassaemia in this geographic locality, thus facilitating better diagnostic and 
management of the disease. The prevention of severe a- and (3-thalassaemia syndrome is very much 
dependent upon the availability of molecular characterisation, supported by adequate genetic 
counselling, and targeted public awareness programmes. The molecular diagnosis of a- and 
(3-thalassaemia patients followed by genetic counselling of at risk couples should be made available as 
it is essential for the accurate diagnosis of both carrier and disease states. 

In addition, such an effort would reduce the economic burden, and comprehensive and effective 
management of this problem in our country will be better achieved. Thus, a coordinated and interactive 
collaboration between the relevant stakeholders is necessary to ensure the effectiveness and success of 
the National Prevention and Control Programme for Thalassaemia in Malaysia. 
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